Parallel and fully recursive multifrontal sparse Cholesky

نویسندگان

  • Dror Irony
  • Gil Shklarski
  • Sivan Toledo
چکیده

We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the blas and lapack. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state-of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes. © 2003 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel and Fully Recursive Multifrontal Supernodal Sparse Cholesky

We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a supernodal multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense-matrix subroutines that are part of the code, although the code can also use the BLAS and LAPACK. The new code is recursive at both the sparse and the dens...

متن کامل

On Evaluating Parallel Sparse Cholesky Factorizations

Though many parallel implementations of sparse Cholesky factorization with the experimental results accompanied have been proposed, it seems hard to evaluate the performance of these factorization methods theoretically because of the irregular structure of sparse matrices. This paper is an attempt to such research. On the basis of the criteria of parallel computation and communication time, we ...

متن کامل

A New Recursive Implementation of Sparse Cholesky Factorization

Consider the Cholesky factorization of a sparse symmetric positive de nite matrix, A = LL . The rst two steps use symbolic, graph-theoretic techniques to order A to reduce ll in L, and to determine the exact sparsity structure of L. The factor L is computed in a third \numeric factorization" step. The two leading schemes for numeric factorization are a blocked column-oriented scheme, and a mult...

متن کامل

Parallel Sparse Cholesky Factorization

Sparse matrix factorization plays an important role in many numerical algorithms. In this paper we describe a scalable parallel algorithm based on the Multifrontal Method. Computational experiments on a Parsytec CC system with 32 processors show that large sparse matrices can be factorized in only a few seconds.

متن کامل

A Comparison between a Direct and a Multigrid Sparse Linear Solvers for Highly Heterogeneous Flux Computations

Subsurface hydraulic properties are mainly governed by the heterogeneity of the porous medium considered. Our work aims at characterizing the asymptotic dispersion coefficients for highly heterogeneous permeability fields triggered by advection and constant local dispersion-diffusion. We have developed a fully parallel software for simulating flow and transport. We have compared two well-known ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Future Generation Comp. Syst.

دوره 20  شماره 

صفحات  -

تاریخ انتشار 2004